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Abstract 

Background: Transmission models can aid understanding of disease dynamics and are useful in testing the efficiency 
of control measures. The aim of this study was to formulate an appropriate stochastic Susceptible-lnfectious-Resistant/ 
Carrier (SIR) model for Salmonella Typhimurium in pigs and thus estimate the transmission parameters between states. 

Results: The transmission parameters were estimated using data from a longitudinal study of three Danish 
farrow-to-finish pig herds known to be infected. A Bayesian model framework was proposed, which comprised 
Binomial components for the transition from susceptible to infectious and from infectious to carrier; and a Poisson 
component for carrier to infectious. Cohort random effects were incorporated into these models to allow for 
unobserved cohort-specific variables as well as unobserved sources of transmission, thus enabling a more realistic 
estimation of the transmission parameters. In the case of the transition from susceptible to infectious, the cohort 
random effects were also time varying. The number of infectious pigs not detected by the parallel testing was 
treated as unknown, and the probability of non-detection was estimated using information about the sensitivity 
and specificity of the bacteriological and serological tests. The estimate of the transmission rate from susceptible 
to infectious was 0.33 [0.06, 1.52], from infectious to carrier was 0.18 [0.14, 0.23] and from carrier to infectious 
was 0.01 [0.0001, 0.04]. The estimate for the basic reproduction ration (R 0 ) was 1.91 [0.78, 5.24]. The probability of 
non-detection was estimated to be 0.18 [0.12, 0.25]. 

Conclusions: The proposed framework for stochastic SIR models was successfully implemented to estimate 
transmission rate parameters for Salmonella Typhimurium in swine field data. R 0 was 1.91, implying that there was 
dissemination of the infection within pigs of the same cohort. There was significant temporal-cohort variability, 
especially at the susceptible to infectious stage. The model adequately fitted the data, allowing for both observed 
and unobserved sources of uncertainty (cohort effects, diagnostic test sensitivity), so leading to more reliable estimates 
of transmission parameters. 
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Background 

Salmonella Typhimurium is one of the major food- 
borne pathogens currently causing disease in humans 
[1] and it is often related to consumption of pork 
products. Given its relevance to consumer food safety, 
Salmonella spp. control was considered necessary by 
the European food safety policy makers under the EC 
Regulation 2160/2003. In the near future, it is possible 
that a mandatory target reduction will be put in place 
in the European Union, regarding the Salmonella preva- 
lence for pigs. 

In practice, however, the control of this agent has 
proved to be difficult and expensive at the farm level [2]. 
Consequently, evaluating the efficiency of control strat- 
egies relating to this agent has become an important 
issue, as stated in recent reports [3]. Modelling the dy- 
namics of Salmonella Typhimurium in pigs is important 
in evaluating alternative control strategies. The basic 
reproduction ratio (R 0 ) expresses the number of second- 
ary cases to which a primary case gives rise during the 
infectious period and provides valuable information for 
simulation models. If Ro is less than unity then the dis- 
ease is receding, but when it is higher than unity the dis- 
ease is spreading. 

Susceptible - Infectious - Resistant (SIR) models are 
attractive tools that aid understanding of disease dynam- 
ics. The SIR model formulates the changes of individuals 
in the population between different disease states in 
terms of a system of ordinary differential equations 
(ODE), known as the Kermack-McKendrick ODE model 
[4]. The variables in the system are given by the three 
components groups: susceptible (S), infectious (I) and 
carriers (R). SIR models include a mathematical specifica- 
tion of the movement into and out of the three compo- 
nents. The key parameter in each of these mathematical 
specifications is the transition rate: from S to I (|3), from I 
to R (a) and from R to I (v). If such modelling is to be 
helpful in infectious disease control, it is critical to have 
the best possible estimates of these rates (|3, a and v), 
as all three are important for modelling the spread of 
infection. Transmission data, generated under controlled 
conditions (i.e. experimental studies or controlled field 
studies) are necessary to estimate the transition rates as 
accurately as possible. 

In most cases Salmonella Typhimurium causes sub- 
clinical infection in swine with no apparent symptoms of 
disease. This makes it difficult to assess the infection sta- 
tus of individual pigs in an infected population without 
testing each animal several times. One of the difficulties 
in obtaining accurate estimates for (3 in Salmonella 
Typhimurium studies stems from the fact that the cur- 
rently available bacteriological and serological tests used 
to assign the infection status are imperfect, introducing 
uncertainty when trying to classify each animal. Yet 



another source of uncertainty comes from the fact that 
pigs, once infected, shed the agent intermittently. 

In the literature, it is conventional to use generalised 
linear models (GLMs) to describe the counts of animals 
moving between states (e.g. from S to I) using either 
Poisson [5-8] or Binomial distributions [4,9]. GLMs can 
be used to estimate all three transmission parameters 
although they lack flexibility, for example in capturing 
the effect of sensitivity and specificity of the diagnostic 
test used. GLMs also lack the flexibility to allow for un- 
observed effects of variables not recorded in the data, 
but which influence the outcomes. The Bayesian frame- 
work proposed in this paper is flexible enough to incorp- 
orate such effects and also quantify the uncertainty due 
to imperfect diagnostic tests. 

Following cohorts of animals in order to determine 
the dynamics of S. Typhimurium in susceptible popula- 
tions is a very expensive procedure and only a few such 
studies exist. In this paper, we use data from a previous 
observational study designed and performed by Kranker 
etal. [10]. 

A Bayesian modelling framework was proposed and 
used to estimate transmission parameters (transition rate 
from S to I, transition rate from I to R and transition 
rate from R to I) for Salmonella Typhimurium in pig 
herds, using the longitudinal data from Kranker et al. 
[10]. The sensitivity and specificity of the tests used to 
classify the animals in the Kranker study were allowed 
for in the statistical model, which also incorporated ran- 
dom effects to allow for cohort heterogeneity. 

Methods 

Study herds, sampling, bacteriology and ELISA test 

The data used have been previously described by Kranker 
et al. [10] and originate from three Danish pig herds 
known to be infected with Salmonella Typhimurium. The 
herds had moderate to high levels of Salmonella Typhi- 
murium and therefore the within-herd prevalence was 40% 
or higher. These measures of prevalence were based on 
meat-juice samples collected over three months, evaluated 
by use of an optical density (OD) cut-off of 20%. Two of 
the farms, with 650 and 440 sows, respectively, were two- 
site operations; the remaining farm was a three-site oper- 
ation with 300 sows. The three herds were self-supplying. 
In each herd, 10 litters were randomly selected, and in 
each litter, the ears of six randomly selected piglets were 
tagged. To account for variations in Salmonella shedding 
over time, litters from each herd were divided into two 
groups of five litters that were raised at approximately 
one-month intervals. Thus, on each farm there were two 
cohorts consisting of 30 pigs each, yielding a total of 180 
piglets at the start of the study. All ear-tagged pigs from 
a given cohort were raised together for the entire obser- 
vation period. The animals were followed longitudinally 
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[10] and were first tested at the age of four weeks and 
thereafter at two to five week intervals until the age of 
slaughter. Slaughter age varied between cohorts but was 
on average around 25 weeks. Cohorts were tested either 
six to seven times in total. On each testing occasion, sera 
and faeces from the animals were collected and tested for 
the presence of Salmonella spp.. At the age of four weeks 
only faeces were collected, because maternal antibodies 
still present could give a false positive result. An animal 
was considered serologically positive, wherever the sero- 
logical test revealed a result of OD% >20, and bacterio- 
logically positive if Salmonella was isolated from the 
faeces. The serological test used at this cut-off value is 
considered to have a sensitivity of 68% and to be 100% 
specific [11]. The bacteriological test is considered to be 
100% specific and the sensitivity is around 30 to 55% 
[12]. These test characteristics were incorporated in the 
statistical model. 

Infection status of the pigs 

The testing time interval was different in each cohort, 
varying from two to five weeks. A homogenous dataset 
was derived by inferring the infection status of each pig, 
every two weeks. The time step of two weeks was chosen 
because on average it takes two weeks for an animal to 
test positive to serology after being infected. It was 
therefore assumed that an animal was infectious in the 
two weeks before being seropositive. The most likely 
infection status of each pig was determined for each 
two-week time step based on both faecal shedding and 
serology results from the sampling period closest to each 
time step. Each animal was categorized as susceptible (S), 
infectious (I) or carrier (R). An animal was considered 
susceptible if the agent was not present and the animal 
was at the risk of infection. An animal was considered 
infectious if it was shedding the agent and a potential 
source of infection to other animals. An animal that was 
infected with the agent but not shedding and therefore 
not able to infect other animals was considered a car- 
rier. In the absence of reasonable sensitivity of the bac- 
teriological culture method, serology offered an alternative 
and complementary way to assign the infection status of a 
pig and both methods were used in parallel to categorise 
the pigs' status. 

Pigs were attributed status S when there was no pres- 
ence of bacteria in the faecal samples and the OD% was 
below 20. Status I was assigned from the date when a 
pig was found to be bacteriologically positive until it was 
no longer positive by this testing method. Additionally, 
pigs were assigned to status I based on seroconversion. 
The beginning of the infectious period was set to two 
weeks prior to the recorded date of seroconversion 
[13,14] and the duration was set to four weeks, assuming 
that a pig would shed Salmonella spp. within an average 



of four weeks. This average period was based on experi- 
mental data regarding duration of the shedding period 
[13,15]. Thus, information was used from both tests 
in parallel for pig classification. Finally, status I was 
followed by status R and the pigs could return to status I 
if they were found to be culture positive later on during 
the study period. It was assumed that no pig would re- 
turn to the susceptible status after being infected, be- 
cause of the relative short life span of finisher pigs (after 
infection it takes around 112 days to clear the agent from 
the organs [14], and post-weaned pigs are generally 
slaughtered before this time). The following describes a 
particular example of how the classification was per- 
formed: if a pig was shedding at a specific testing time, 
it was considered infectious in the nearest bi-weekly 
time step, until it became bacteriologically negative, 
after which it was considered a carrier (in the nearest 
bi-weekly time step). If a pig was serologically positive, 
in the presence of a negative culture, it was considered 
infected and therefore classified as infectious for at 
least four weeks, from the nearest bi-weekly time step 
prior the testing time. If an animal was bacteriologic- 
ally and serologically negative, it was considered sus- 
ceptible. Given that testing of piglets was restricted to 
bacteriology (which has low sensitivity) at the begin- 
ning of the follow-up period, some piglets infected by 
the sow could have been erroneously classified as sus- 
ceptible. For this reason, the analysis in each cohort 
started at the time infected animals were first detected 
(by either serology or bacteriology). 

Estimation of the transmission parameters 

Conventionally, transmission parameters of infectious dis- 
ease, including Salmonella spp., in swine herds [14,16-21] 
are estimated using regression models. These are often 
based on data describing the prevalence of that disease 
in the country or region to which the particular study 
refers. As suggested in some studies [5,22,23], we first 
applied a stochastic SIR models in the form of General- 
ised Linear Models (GLMs) in order to estimate the 
three transmission parameters. However, preliminary re- 
sults (not reported here) suggested the presence of over- 
dispersion in the GLMs, hinting towards unobserved 
sources of variation in the data such as cohort heterogen- 
eity. Instead, we here report a framework for stochastic 
SIR models which i) extends the current GLM frame- 
work by including random effects, ii) is implemented 
using a Bayesian approach thus allowing incorporation of 
prior information (such as the sensitivity of Salmonella 
tests), iii) explicitly estimates the probability of not de- 
tecting infectious animals due to test sensitivity and 
iv) incorporates all sources of uncertainty/variation thus 
obtaining more realistic estimates of transmission parame- 
ters. As suggested by some authors [5], the inclusion of 
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random effects automatically accounts for overdispersion 
by inflating the variance of the response variable while at 
the same time allowing for cohort heterogeneity. 

Stochastic SIR models (and other variants such as SI 
or SIS models) are well-established in animal disease lit- 
erature [4,19-21]. The benefit of using a stochastic SIR 
model is that transmission parameters can be estimated 
using statistical modelling; here the conventional sto- 
chastic SIR model was extended by explicitly allowing 
for cohort variation and unobserved temporal effects. 
The three components of the stochastic SIR model are 
described in detail below. 

Transition from susceptible to infectious 

It was assumed that pigs become infected by "infectious 
contacts" defined as: either contact with other infected 
animals, or contact with their environment (rodents, 
contaminated muck or feed). The rate at which a given 
animal has infectious contacts was assumed i) to be con- 
stant in time and ii) proportional to the density of 
infectious animals [19], with a constant of proportionality 
|3, i.e. the transmission rate parameter. In other words, 
the infectious contacts per animal happen randomly in 
time so that their occurrence can be described by a Pois- 
son process. More precisely, the number of infectious 
contacts per animal in a period At is Poisson distributed 
with mean \ = P (I/N) At, where I is the number of infec- 
tious animals and N is the total number of animals, at 
the beginning of At. As such, the probability of no infec- 
tious contacts per animal in At is exp (-(3 (I/N) At), im- 
plying that the probability of infection in At is p = 1- exp 
(-P (I/N) At). This means that the number of new cases C 
at the end of At is Binomial with parameters S and p so 
that the mean of C is S*p. 

Here, the current established methodology was extended 
to allow for the fact that i) \ may vary in time due to ex- 
ogenous factors and ii) \ may vary across cohorts due to 
unobserved cohort effects. A random (scaling) effect exp 
(r jt ) was included, for the j th cohort at time t, to get X jt = P 
(I/N) exp (rj t ) At as the mean number of infectious con- 
tacts of a random animal, in herd j at time t. Note that At 
denotes the length of a time interval whereas t refers to ac- 
tual time. On average, exp (rj t ) was assumed to be equal to 
one, so that across all cohorts and time, the average trans- 
mission rate parameter is still p. By doing this, variation 
due to cohort or unknown temporal effects was explicitly 
modelled, which would otherwise contribute to the uncer- 
tainty in estimating p. 

All time intervals in the data are equal to two weeks 
so for clarity, At = 1 was set so that one time step At cor- 
responds to two weeks. This does not qualitatively affect 
the estimation of the transmission parameters. Because 
of the nature of the data, time t is now defined in 
discrete consecutive (bi-weekly) time steps. 



The model may be formulated as follows: 

Cj t ~ Binomial ^Sj t: Pj t ^ 

Pjt = l-exp{-/S(l )t -i/N )t -i) exp(rij t )} 

cloglog (p jt ) = log(jS) + Log(l jt -i)-log(Njt_i) + ri Jt 

(1) 

where: 

- Cj t denotes the number of new infectious animals in 
cohort (j) at the end of the time step (t),. 

- Sjt-i is the number of susceptible animals in cohort 
(j) at the end of the time step (t-1), 

- Pj t is the probability of a susceptible animal in 
cohort (j) at the end of time step (t-1) becoming 
infectious by the end of time step (t), 

- cloglog is the complementary log-log transformation, 

- P is the transmission rate parameter for the 
transition from susceptible to infectious, 

- lj t ^j is the number of infectious animals in cohort (j) 
at the end of the time step (t-1), 

- Np-i is the total number of animals in cohort (j) at 
the end of the time step (t-1), and. 

- Tyt is a cohort time-dependent random effect (which 
is zero on average). 

Note that, at the beginning of the study, pigs were 
considered to be either in the S or I status depending on 
the test results. When there was no infectious pig 
present at the end of the previous time step, i.e. Ij t -i = 0, 
the probability of becoming infectious was modelled as: 

Cloglog (pj t ) = log (P) + r ljt . This is because even if there 
are no infectious pigs present, animals can still be infected 
(e.g., contaminated environment, feed, water, etc.). In this 
formulation, p is seen as the underlying rate of transition 
for a random pig in an average cohort with no infectious 
animals, while r^t allows for unobserved cohort-time ef- 
fects in the data e.g., anthropogenic influence, rodents etc. 
Note that homogeneous mixing of the pigs in each cohort 
(i.e. all pigs could come into contact with each other) was 
assumed, due to the small size of the cohorts. 

In using the number of infectious pigs Ij t , in each co- 
hort at the end of time step t, it was necessary to ac- 
count for the sensitivity of both the serological and 
bacteriological test. Since the specificity in both tests is 
considered to be 100%, the parallel specificity is 1. This 
implies that Ij t = Iobs jt + Inob jt , where Iobsj t is the observed 
value and Inobj t is the number of infectious animals not 
detected (false negative pigs). In other words, Iobs^ is a 
lower bound on the actual I jt . The unobserved variable 
Inobj t may be incorporated (and thus estimated) in the 
stochastic model and here it was assumed that it has a Bi- 
nomial distribution with parameters Nj t and pND where 
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pND is the probability of not detecting infectious animals. 
This probability, pND, is of course dependent on the sen- 
sitivity probabilities of each test, which were assumed to 
be independent. Inobj t was modelled as follows: 

Ijt = Iobsjt + Inobjt 

Inobjt ~ Binomial (Nj t , pND) (2) 
pND = (l-SenC)*(l-SenE) 

where: 

- SenC is the sensitivity probability of microbiological 
culture, and. 

- SenE is the sensitivity probability of the ELISA test. 

Treating Inobj t as an unobserved random variable allows 
formal quantification of the uncertainty in the data due to 
test sensitivity and constitutes one of the novelties of the 
proposed model. The Bayesian framework (see section 5 
later on) used to estimate the stochastic SIR model can 
easily incorporate the estimation of Inobj t given prior infor- 
mation on SenC and SenE. 

Transition from infectious (I) to resistant (R) 

The rate a at which a random infectious animal in a 
given cohort becomes a carrier was assumed to be con- 
stant in time. As such, the length of time t until an in- 
fectious animal becomes carrier can be modelled by an 
exponential distribution with rate parameter a. So, given 
that the animal is infectious at the start of time interval 
At, the probability pR of becoming carrier is pR = Pr 
(t < At) = 1-exp (-aAt) since x is exponentially distributed 
(recall that At = 1 was set for conciseness). As before, a 
random cohort effect r 2 j was added to allow for cohort 
heterogeneity in the data, to obtain pRj = 1-exp (-aexp 
(r2j)). The number of new carrier animals RneWj t at the 
end of time step t, is thus Binomial with parameters Ij t 
and pRj. Note that a single parameter a was utilised, de- 
scribing the rate at which a random infectious animal in 
an average cohort, becomes carrier, however, cohort vari- 
ability (not all cohorts are average) was allowed for 
through r 2 j, which in turn reduces uncertainty in estimat- 
ing a. The I to R transition was modelled as follows: 

Rnewjt ~ Binomial ^Ijt, pRj^ 
cloglog^pRj J = log(a) + r 2j 

Transition from resistant to infectious 

For this compartment of the model, the rate of infec- 
tious contacts v in a random carrier animal was assumed 
to be constant in time, where v is the transmission rate 
parameter for the transition from carrier to infectious. 
With similar arguments as in the S to I compartment, 



the number of infectious contacts per animal in time 
period At is Poisson distributed with mean vAt. Since 
this transition was actually a rare event (only happening 
three times in the entire study), the Poisson distribution 
can be used, as it approximates the Binomial when its 
probability parameter is close to zero. So if in cohort j, 
there are Rj t -i carrier animals at the end of the previous 
time step, the number of transitions from R to I in time 
step t may be modelled as a Poisson variable with mean 
Uj t = vRjt_iexp (r 3j ). More explicitly: 

Inewjt ~ Poisson (pj^J 

(\ (^) 
Pjt) = lo g( v ) + lo g( R jt-i) + r 3 j 

where: 

- Inewjt denotes the number of new infectious animals 
(that result from this transition) in cohort (j) at the 
end of the time step (t), 

- (ij t is the mean number of carrier animals that become 
infectious in the cohort (j) during time step (t), 

- v is the transmission rate parameter for the 
transition from carrier to infectious state, 

- Rjt-i is the number of carrier animals at the end of 
the time step (t-1) in cohort (j), and. 

- r 3; - is a cohort random effect that allows for cohort 
heterogeneity. 

Note that Rj t _i = 0 is possible, in which case log (Rj t _i) = 0 
was set. The argument for doing this is that the transmis- 
sion rate parameter v may be defined as the limit of Uj t /R jt . 
!as Rj tl goes to zero. As such, ignoring the random effect 
for a moment, Uj t /Rj t _i should tend to a constant (i.e. v) as 
Rjt-i goes to zero rather than infinity. Note that in our data, 
Rjt-i = 0 happened on 20% of occasions. In the hypothetical 
case that Rj t -i = 0 for the majority of time steps and co- 
horts, then this component of the model (i.e. the transition 
R to I) becomes redundant as there will ultimately be al- 
most no information with which to estimate the transition 
parameter. 

Cohort random effects 

As indicated above, random cohort effects were incorpo- 
rated into each transition step to allow for i) cohort het- 
erogeneity/variability in the data, ii) unobserved cohort- 
specific factors, iii) unobserved temporal effects in the S 
to I compartment. These effects were different for each 
transition under the assumption that any unobserved co- 
hort factors affect each transition in a different way. For 
the transitions S to I and R to I, these random effects 
also allow for factors which affect disease spread but 
which are not dependent on the animals themselves 
(such as contaminated environment, feed, water, etc.). 
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For the transition S to I, the cohort random effects 
were assumed to be time-varying and auto-correlated, 
and were modelled as: 

ry,t=i~ Normal (0, erf) 
ry, t ~ Normal (ry, t -i , of) 

where the cohort random effect (iy t ) for time step t de- 
pends on the previous cohort random effect at time (t-1). 
With this cohort time-dependent random effect any unob- 
served dynamic behaviour in the spreading of infection 
within cohorts was captured, such as that due to infected 
mice. 

For the transition I to R and R to I, the random effects 
were modelled as: 

r kj ~ Normal(0, of) , k = 2, 3 (6) 

where: 

- subscript / denotes cohorts and. 

- o\ is the variance of the unobserved cohorts effects. 

In a preliminary model building stage, a cohort time- 
dependent random effect, r 2jt was considered for the tran- 
sition I to R; however the results showed no improvement 
to the model fit. Note that cohort time-dependent random 
effects were not considered for the Poisson model of the 
transition R to I. The transition only occurred three times 
in the entire study and it would be unreasonable to try to 
estimate unobserved temporal effects from this. 

Model implementation 

The overall SIR model described above was implemented 
in a Bayesian framework and fitted using Markov chain 
Monte Carlo (MCMC). In this framework, parameters 
are treated as random variables whose "prior" distribu- 
tion expresses our uncertainty about their value before 
any data is observed. Prior distributions (priors) are com- 
bined with the observed data through Bayes theorem to 
produce the posterior distributions for each parameter 
(posteriors). The posteriors express the uncertainty about 
model parameters after data is observed and all statis- 
tical inference is based solely on the posteriors. MCMC 
is a numerical technique which produces samples of 
values that eventually converge (after a certain "burn- 
in" number) to samples of values from the posterior 
(distribution) of each parameter. 

There was no historical information with which to in- 
form the prior distributions of log (|3), log (a) and log (v), 
so Normal distributions with zero mean and a variance of 
100 were used, reflecting prior ignorance while avoiding 
the use of improper prior distributions [24]. For the sensi- 
tivity probabilities of both serological and bacteriological 
tests, a Beta distribution was used as a prior. Previous 



information about the sensitivity of both tests [11,12] was 
used to inform those Beta distributions: a mean of 0.49 for 
faecal culture and a mean of 0.68 for Danish mix ELISA 
were assumed, so SenC ~ Beta (48.5, 50.5) and SenE ~ Beta 
(58.5, 27.5) were specified. These priors have means 0.49 
and 0.68 respectively, and variances that match the range 
of possible values dictated by the findings of [11,12]. Speci- 
ficity was assumed to be 100% in both tests. The precision 
(i.e. the inverse of the variance) of the Normal distribution 
for each random effect was given a Gamma (0.5, 0.005) 
prior distribution (large mean and very large variance to in- 
dicate prior ignorance). 

The complete SIR model was implemented in the 
open-source statistical software WinBUGS [25]. Exactly 
100,000 posterior samples were collected after a 5,000 
sample burn-in to ensure convergence to the posterior 
distribution [26] . Two MCMC runs were performed, with 
different initial values, to ensure convergence and mix- 
ing. The samples were thinned by only collecting one in 
10 consecutive samples to eliminate autocorrelation in 
posterior samples (the R package "coda" [27] was used), 
so that in total we ended up with 20,000 samples. 
Convergence was assessed by inspection of trace-plots but 
also more formally using the Raftery and Lewis diagnostic, 
and the Gelman-Rubin R-hat diagnostic which should be 
sufficiently close to one if convergence has been achieved 
[28,29]. Mixing in the chains was assessed by comparing 
the Markov Chain (MC) error with the standard deviation, 
for each parameter. Ideally the MC error for each param- 
eter should be less than 5% of the standard deviation [30] 
for good mixing. 

Posterior predictive simulation was used for model 
checking as described by Gilks et al. [24]. This technique 
is effectively testing whether the observed data are ex- 
treme in relation to the posterior predictive distribution 
of the observations (i.e., the fitted model). The deviance 
was the measure adopted for comparison. The technique 
involves the calculation of a "p-value" which should not 
be extreme (close to 0 or 1) for good model fit. 

Calculations of the basic reproduction ratio (R 0 ) 

Samples from the posterior distribution of R 0 were 
calculated from those of (3 and a using the following 
formula [5]: 

Ro = "/» ( 7 ) 

where [3 is the transition rate from S to I, and a is the 
transition rate from I to R. 

Results 

Transmission parameters 

Results, in terms of summary statistics from the posterior 
samples, are shown in Table 1. Note that the posterior 
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Table 1 Summary measures of the transmission parameters and random effects variances from the Salmonella 
transmission in pigs SIR model 



Parameters Mean Standard deviation Quartiles Rhat 









2,5% 


25% 


50% 


75% 


97,5% 




p 


0.44 


049 


0.06 


0.20 


0.33 


0.52 


1.52 


1.0021 


a 


0.18 


0.02 


0.14 


0.17 


0.18 


0.20 


0.23 


1 .0009 


V 


0.02 


0.03 


0.0001 


0.006 


0.01 


0.02 


0.04 


1 .0009 


a? 


3.00 


1.80 


0.80 


1.76 


2.60 


3.77 


7.59 


1.0011 


°l 


0.02 


0.04 


0.002 


0.005 


0.01 


0.02 


0.096 


1 .0009 


°l 


6.64 


38.82 


0.003 


0.06 


0.08 


3.85 


44.44 


1.0010 


pND 


0.18 


0.03 


0.12 


0.16 


0.18 


0.20 


0.25 




Ro 


2.20 


1.25 


0.78 


1.46 


1.91 


2.56 


5.24 





Legend: p - transition rate from susceptible to infectious, a - transition rate from infectious to carrier, v - transition rate from carrier to infectious, o^ - variance of 
the random effects for the transition from susceptible to infectious, o\ - variance of the random effects for the transition from infectious to carrier, O3 - variance of 
the random effects for the transition from carrier to infectious, pND - probability of non-detection of infectious animals, R 0 - basic reproduction ratio. 



samples are MCMC samples from the posterior distribu- 
tion of each model parameter and inference is based on 
those samples. A point estimate, the standard error and 
the 95% credible interval for a parameter are, for example, 
calculated as the sample mean, the sample standard devi- 
ation and the sample 2.5% and 97.5% quantiles of the pos- 
terior samples for that parameter. 

The MCMC convergence was considered acceptable 
since the R-hat for all parameters (including random 
effects) was never above 1.01. The results of the model did 
not significantly differ when the parameters of the priors 
for the sensitivity tests were varied (increasing and decreas- 
ing them by 10%). 

The posterior distribution for transition rate a (I to R) 
was symmetric, but for the transition rate |3 (S to I) and 
v (R to I), the posterior distributions were asymmetric 
(Figures 1, 2 and 3). As such, the posterior median was 
chosen to best summarise the value of these parameters. 
The median for the transition rate |3 was 0.33, for a it 
was 0.18 while for v it was 0.01 (Table 1). The median of 
the variance of cohort random effects for the transitions 
I to R and R to I was close to zero, which implies that 
there was little significant variation between cohorts for 
these two transitions of the model. The median of the 
variance of the cohort-time dependent random effect for 
the transition S to I was 2.6 (95% credible interval [0.80; 
7.59]), meaning that the cohort random effect is signifi- 
cant for this transition (Figure 4). The overall model fit 
was satisfactory with a "p-value" of 0.24 implying no sig- 
nificant difference between posterior predictive simula- 
tions (predictions from the model) and observed data. 

Basic reproductive ratio (R 0 ) 

Summary statistics of the posterior distribution of the R 0 
parameter are shown in Table 1. The posterior median 
of R 0 was 1.91, with a 95% credible interval of 0.78 to 



5.24. A density estimate of the posterior samples of R 0 , 
which effectively describes the spread of Salmonella spp. 
in these three Danish pig herds known to be infected 
with Salmonella, is shown in Figure 5. For moderate to 
high within herd Salmonella prevalence, this Ro distribu- 
tion suggests that Salmonella Typhimurim can range 
from fading out scenarios to epidemic ones but most of 
the time the infection spread assumes an endemic form. 

Diagnostic test sensitivity (pND) 

Recall that this modelling framework includes the esti- 
mation of the probability of failing to detect infectious 




0 2 4 6 8 10 12 



P 

Figure 1 Posterior distribution of p. Legend: A) Plot of the 
posterior distribution for transmission parameter (3, which describes 
the rate of spread of Salmonella Typhimurium from susceptible to 
infectious animals; B) Boxplot of the posterior samples used to 
produce the plot where the thick line in the box reflects the median. 
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a 

Figure 2 Posterior distribution of a. Legend: A) Plot of the 
posterior distribution for transmission parameter a, which describes 
the rate of spread of Salmonella Typhimurium from infectious to 
resistant animals; B) Boxplot of the posterior samples used to produce 
the plot where the thick line in the box reflects the median. 
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Model time interval 
Figure 4 Posterior distribution of the B random effects for 
cohort two over time. Legend: Plot of the posterior distribution of 
the (3 random effects (time and cohort) for one cohort over time 
with the mean and 95% credible intervals. 



animals, pND, using both the data but also prior informa- 
tion about the tests sensitivity [11,12]. Figure 6 shows a 
density estimate plot of the posterior distribution of pND. 

Discussion 

In this paper field data were used from a study conducted 
in three Danish pig herds which were known to be infected 



B 




Figure 3 Posterior distribution of v. Legend: A) Plot of the 
posterior distribution for transmission parameter v, which describes 
the rate of spread of Salmonella Typhimurium from resistant to 
infectious animals; B) Boxplot of the posterior samples used to 
produce the plot where the thick line in the box reflects the median. 



with Salmonella Typhimurium [10]. The data was con- 
sidered sufficiently reliable to be used in estimating 
the transmission parameters of an SIR model describ- 
ing the evolution of the disease and from that to infer R 0 
for Salmonella Typhimurium in finisher pigs. 

In comparing the parameters to those found in other 
studies (whether simulation- or observation-based studies) 



B 



0 10 20 30 40 50 60 

R0 

Figure 5 Posterior distribution of the basic reproduction ratio 
(R0). Legend: A) Plot of the posterior distribution for the basic 
reproduction ratio (R0). The vertical line shows the threshold value 
R0= 1 where dissemination of the infection occurs; B) Boxplot of 
the posterior samples used to produce the plot where the thick line 
in the box reflects the median. 
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pND 

Figure 6 Posterior distribution of the probability of non-detection 
of infected animals (p/VD). Legend: A) Plot of the probability of 
non-detection of infected animals (pND) due to the test characteristics; 
B) Boxplot of the posterior samples used to produce the plot where 
the thick line in the box reflects the median. 



it is important to take into consideration that the time 
interval used in this study was two weeks whereas in 
past simulation studies it varies from one day [14,17] 
to one week [18]. As the transmission parameters are 
rates, however, they can be easily transformed to relate 
to different time steps. Although the herds used in 
the Kranker study [10] had moderate to high levels 
of Salmonella Typhimurium prevalence, the median 
estimates of the transmission parameters from this study 
are lower than those found in previous simulation studies 
[16,18]. The transition rate [3 from S to I is slightly higher 
when compared to the Lurette et al. study [18] although 
the other rates (a, v) are lower than the equivalent pa- 
rameters in that same study. So the application of our 
proposed modelling framework to these Danish herds re- 
sulted in estimates comparable to similar studies (note 
that this approach could easily be used with data from 
other countries). Moreover, the prevalence of Salmonella 
in finishing pigs in Denmark is known to be close to the 
average prevalence in the EU [31]. 

To the best of our knowledge this is the first study of 
transmission rate parameters for Salmonella Typhimur- 
ium in swine that estimates the parameters using field 
data and a Bayesian probabilistic approach incorporating 
random effects. 

Bias of the study 

Correct classification of the infectious status of the indi- 
vidual pig is difficult for Salmonella Typhimurium infec- 
tion, because the diagnostic tests currently used are 
imperfect [32-37]. Bacteriology lacks sensitivity given 



intermittent shedding of Salmonella by infected pigs, 
whereas using serology in individuals can be associated 
more with a past exposure to the agent than a current 
exposure; therefore it can lack specificity for detecting 
shedding animals. Positive serology also shows a delay 
between infection and expression, leading to some lack 
of sensitivity. When analysing the data, the lack of 
sensitivity was accounted for by: i) starting the ana- 
lysis when at least one infected pig per cohort was 
observed and ii) using the probabilistic framework to 
predict the infectious animals that were not detected 
with these tests, from appropriately informed distribu- 
tions based on the sensitivity of each test. 

For optimal estimation of transmission parameters, the 
time step between each sampling should preferably be as 
short as the average generation interval, spanning the 
time when one animal becomes infectious to the time 
when a second animal becomes infectious because of 
the first animal. The time steps in this data (two weeks) 
are not ideal - preferably a few days or perhaps one 
week would be better [13]. However as previously dis- 
cussed, the available data did not allow for such an op- 
tion and it would be very costly to obtain new data. As 
data from a published study [10] were used, the time 
step was set to be an approximation of the different 
testing intervals within and between cohorts, given the 
limitations offered by the original set of data, and an ap- 
proximation to the time of seroconversion [13,38]. This 
approximation could have affected the estimation of pa- 
rameters due to the extended time interval between test- 
ing occasions. Nevertheless, comparison with the results 
published in other studies does not seem to support this 
hypothesis. Concerning the cohorts, it is clear from the 
Kranker study [10] that particular attention was paid to 
the selection of the herds, which were taken from a large 
population of Danish finishing herds with a well-known 
status for Salmonella. This gave us confidence regarding 
the generalization of our results, at least for herd infected 
with the same serotype (S. Typhimurium). 

Transition parameters and R 0 values 

Note that the stochastic SIR model presented here is 
only a discrete-time approximation to the real transmis- 
sion dynamics, i.e. limited to bi-weekly intervals. In 
particular, when the number of susceptible animals is 
small and the infection intensity high, then the ex- 
pected number of infectious animals will tend to be 
overestimated [5]. 

The estimate of the transition rate (3 (from S to I) is 
low compared with other infectious diseases (such as 
swine influenza) and reflects the fact that in most of the 
herds, Salmonella does not cause outbreaks but main- 
tains a residual level of infection represented by infec- 
tious and carrier animals that enable the infection to 
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persist in the herds. The credible interval for the transi- 
tion parameter a (from I to R), matches the variability of 
shedding duration that is known from experimental and 
field studies [13,39]. The transition rate estimate v (from 
R to I) is small and possibly dependent on stress events 
(the authors of the Kranker et al. study [10] report that 
two cohorts in which animals began shedding in a 
second round had a slurry overflow which can be con- 
sidered a stressful event). The variance of the cohort 
time-dependent random effect was high and a possible 
explanation for that is the different management of co- 
horts which in turn induces high variability (between 
cohorts) in the transmission data. In future studies this 
should be taken into consideration as a way to minimize 
transmission of infection. 

For spread to occur, R 0 should be above one. Looking 
at Figure 5 we can see that there is high probability that 
R 0 > 1, 94% specifically. The median R 0 value was 1.91 
indicating that Salmonella Typhimurium was spreading 
in most of the cohorts. The value is not high (third quar- 
tile of R 0 is less than three) implying it would not spread 
rapidly through the susceptible populations under man- 
agement systems similar to the ones used in these herds. 
With lower probability, R 0 is high enough to cause out- 
breaks, e.g. probability that Rn > 5 is 2.5%. 

The Ro 95% Credible Interval (CrI) ranges from 0.78 
to 5.24. The higher values reflect that animals infected 
with a high infectious dose have a longer shedding 
period [13] than the ones infected with low infectious 
dose, and so the former can cause an outbreak. This 
makes sense because Salmonella Typhimurium is an 
agent that primarily spreads via the faecal-oral route. 

Few studies are available to aid in defining infectious 
animals, but the experimental and field studies con- 
ducted by some authors [13,15] support the duration of 
infectiousness used in our study. 

A next step in our investigation will be to include the 
estimated transmission parameters ([3, a, v) in a stochas- 
tic simulation model developed by the authors to simu- 
late the spreading of Salmonella Typhimurium in swine 
herds and thus test the effectiveness of different control 
strategies. 

Conclusions 

A Bayesian framework was proposed, to estimate Samo- 
nella Typhimurium transmission parameters, and has 
been successfully implemented using data from Danish 
pig herds. The model extends the current established 
methodology of utilising GLMs to implement stochastic 
SIR models. Random effects were added to i) explicitly 
allow for cohort heterogeneity in the data (i.e. allow for the 
fact that pigs were grouped in cohorts), ii) capture possible 
unobserved cohort effects and iii) avoid the problem of 
overdispersion. Results in terms of posterior samples allow 



for direct probabilistic statements about model parameters, 
which may be also used in other analyses such as simula- 
tion models for testing management strategies. 

The issue of underestimating infectious pigs due to 
testing sensitivity was addressed by predicting the number 
of non-detected pigs, using i) prior information about test 
sensitivity and ii) the observed data. In doing that, the 
probability of non-detection was treated as an unknown 
parameter which was estimated at the same time as the 
transmission parameters. 

All model unknowns (transmission parameters, cohort 
random effects, non-detected pigs, probability of non- 
detection) were estimated simultaneously, implying that all 
possible sources of uncertainty were modelled, in turn giv- 
ing more confidence about the estimates of the transmis- 
sion parameters. 
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